Payoff-based Inhomogeneous Partially Irrational Play for Potential Game Theoretic Cooperative Control of Multi-agent Systems

نویسندگان

  • Tatsuhiko Goto
  • Takeshi Hatanaka
  • Masayuki Fujita
چکیده

This paper handles a kind of strategic game called potential games and develops a novel learning algorithm Payoff-based Inhomogeneous Partially Irrational Play (PIPIP). The present algorithm is based on Distributed Inhomogeneous Synchronous Learning (DISL) presented in an existing work but, unlike DISL, PIPIP allows agents to make irrational decisions with a specified probability, i.e. agents can choose an action with a low utility from the past actions stored in the memory. Due to the irrational decisions, we can prove convergence in probability of collective actions to potential function maximizers. Finally, we demonstrate the effectiveness of the present algorithm through experiments on a sensor coverage problem. It is revealed through the demonstration that the present learning algorithm successfully leads agents to around potential function maximizers even in the presence of undesirable Nash equilibria. We also see through the experiment with a moving density function that PIPIP has adaptability to environmental changes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Game Theory Approach for Solving the Knowledge Sharing Problem in Supply Chain

Knowledge management is the process of capturing, developing, sharing, and effectively using organizational knowledge as we known Knowledge management literature emphasizes the importance of knowledge as a valuable asset for SMEs. This paper highlights the efficient sharing of knowledge as a way of creating core competencies in the SMEs that are in civil construction activities. This perspectiv...

متن کامل

Cooperative Control of Mobile Robots in Creating a Runway Platform for Quadrotor Landing

Multi-agent systems are systems in which several agents accomplish a mission in a cooperative manner. In this paper, a novel idea for the construction of a movable runway platform based on multi-agent systems is presented. It is assumed that an aerial agent (quadrotor) decides to make an emergency landing due to reasons such as a decrease in energy level or technical failure, while there is no ...

متن کامل

State based potential games

There is a growing interest in the application of game theoretic methods to the design and control of multiagent systems. However, the existing game theoretic framework possesses inherent limitations with regards to these new prescriptive challenges. In this paper we propose a new framework, termed stated based games, which introduces an underlying state space into the game theoretic environmen...

متن کامل

Stochastic cooperative advertising in a manufacturer–retailer decentralized supply channel

This work considers cooperative advertising in a manufacturer–retailer supply chain. While the manufacturer is the Stackelberg leader, the retailer is the follower. Using Sethi model it models the dynamic effect of the manufacturer and retailer’s advertising efforts on sale. It uses optimal control technique and stochastic differential game theory to obtain the players’ advertising strategies a...

متن کامل

A TRANSITION FROM TWO-PERSON ZERO-SUM GAMES TO COOPERATIVE GAMES WITH FUZZY PAYOFFS

In this paper, we deal with games with fuzzy payoffs. We proved that players who are playing a zero-sum game with fuzzy payoffs against Nature are able to increase their joint payoff, and hence their individual payoffs by cooperating. It is shown that, a cooperative game with the fuzzy characteristic function can be constructed via the optimal game values of the zero-sum games with fuzzy payoff...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1107.4838  شماره 

صفحات  -

تاریخ انتشار 2011